N89 - 10203 


5 / 7 - 6 / 

62 3 


TDA Progre^p Report 42-94 



- U 




ApriKJune 1988 



A Software Simulation Study of the Long Constraint 
Length VLSI Viterbi Decoder 

S. Arnold and F. Pollara 
Communications Systems Research Section 


A software simulation of long constraint length Viterhi decoders has been developed . 
This software closely follows the hardware architecture that has been chosen for the 
VLSI implementation. The program is used to validate the design of the decoder and to 
generate test vectors for the VLSI circuits. 


I. Introduction 

Convolutional codes have been used on deep space probes 
for several years. During the last few years, TDA Advanced 
Systems undertook a research effort [1] to develop advanced 
coding techniques capable of gaining an additional 2 dB over 
the present performance of deep space missions. Current cod- 
ing systems are based on a K = 7, r = 1/2 convolutional code 
concatenated with an 8-bit (255,223) Reed-Solomon (RS) 
code, where K is the constraint length and r is the code rate. 

The main result of this research effort was the discovery of 
new convolutional codes with K = 1 5 and r = 1/6 which exceed 
the 2-dB goal when concatenated with a 10-bit (1023,959) RS 
code. Recently, the delay imposed on the Galileo mission 
introduced the possibility of including a K = 15, r = 1/4 code 
in this mission. This experimental code [2] will gain approxi- 
mately 1.5 dB over the current NASA-standard code. The 
Galileo experiment, together with the potential offered by 
these coding gains for future missions, has led to an effort to 
build a VLSI-based Viterbi decoder capable of decoding codes 


with K up to 15 and r = 1 /«, n = 2,3 ,4,5 ,6, at speeds approach- 
ing 1 Mbit/s. 1 

II. Decoder Architecture 

The complexity of a Viterbi decoder depends mainly on the 
constraint length AT, since the number of states is 2^~ 1 . The 
decoder for the new K = 15 codes is approximately 256 times 
more complex than the current MCD (Maximum-likelihood 
Convolutional Decoder) used in the DSN stations to decode 
K = 1 codes. The requirement on the information data rate 
forces the use of heavily parallel architectures. 

After evaluation of several design alternatives, it was decided 
to use a fully parallel architecture consisting of 2 K ~ 2 = 8192 
physical butterflies operating in parallel. Each butterfly uses 
bit-serial arithmetic to perform the internal operations of add- 


1 J. Statman, “Preliminary Design Review for Big Viterbi Decoder 
(BVD),” JPL internal document, March 31, 1988. 
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compare-select, since this is more suitable to fast VLSI cir- 
cuits, and represents the metrics as 16-bit numbers. Each but- 
terfly contains two states of the decoder and outputs two 
decision bits to the trace-back memory. The 8192 butterflies 
are organized in identical VLSI chips containing 32 butterflies 
each, and in 16 identical boards containing 16 chips each [3] . 

The concern was to develop a software simulation of the 
complete decoder so that (1) several new design ideas could be 
tested and validated; and (2) test vectors could be generated 
for signals at various key points in the decoder and then used 
to test the VLSI design. Given the complexity and the cost of 
this project, it was necessary to have a complete software de- 
coder that closely emulated the hardware architecture and 
demonstrated the validity of the design. 


III. Software Decoder 

The software decoder consists of a program developed on a 
SUN 3/260 workstation and written in C-language. Since the 
program runs on a sequential computer, it scans through the 
butterflies in sequential order, while the hardware performs all 
these operations in parallel. The decoder is based on the hard- 
ware design summarized in Figs. 1 and 2. 

Figure 1 represents the metric computer module present 
in each butterfly. It takes the received symbols in sign and 
magnitude representation and computes the two branch 
metrics, p and q , as two 16-bit numbers. The register denoted 
as LABEL ( is initialized at startup time and contains an appro- 
priate label for the ith butterfly. The value of this label is pro- 
vided by the module encoder , whose operation is described by 
the flow diagram of Fig. 3. Here NB represents the total num- 
ber of butterflies (8192), i is the index of the current butter- 
fly, and / the index of encoded symbols e y. First, the n en- 
coded symbols are computed for the current butterfly. Then 
LABEL i is just given by the decimal equivalent of the binary 
array (e 0 ,e l , . . . ,e 5 ). The other input to the metric computer 
module, r max , is just the sum of the magnitudes of the re- 
ceived symbols for each information bit time. Notice that the 
diagram in Fig. 1 shows six input received symbols, but it can 
be used for any code rate r = 1 /«, n = 2, 3 ,4, 5 ,6, by setting the 
unused symbols to zero. Figure 4 shows a flow diagram repre- 
senting the computations taking place in the software. The 
variable / counts the received symbols modulo n. 


The add-compare-select circuit of Fig. 2 takes the branch 
metrics p and q just computed and the previously computed 
accumulated metrics m i0 and m n from states iO and /I 
and generates the updated metrics , m y l and the decision 
bits bit Q and bit 1 , which are stored in the trace-back memory. 
This memory is organized in three banks of L bits each, where 
L is the path truncation length. Decoded bits are given by the 
trace-back performed on the bank containing the “oldest” 
decision bits. The detailed operation of the add-compare- 
select module is shown in the flow diagram of Fig. 5. The test 
for overflow is performed on the output accumulated metric 
m jQ of butterfly number zero. Renormalization occurs if the 
two most significant bits of m j0 are both equal to one. In this 
case, the most significant bit of all accumulated metrics is reset 
to zero to prevent overflow of the metrics. The decoder de- 
scribed in this article and its future VLSI implementation can 
decode any code with connection vectors G t = (x i0 ,x n , . . . , 
* ll4 ), wherex^- G (0,1) andx /0 =x /14 = 1. Code search results 
fl] , [2] show that good codes always meet the constraint of 
having a leading and trailing “1” in the connection vectors. 
Because of this constraint, only two branch metrics, p and q 9 
need to be computed. When K < 15, this constraint is no 
longer met, but it can be observed that in this case m ;1 is 
always equal to m j0 , as shown in [3]. This is accomplished 
with the switch in Fig. 2 or the test (K < 15) in Fig. 5. 

IV. Operation of the Software Decoder 

Testing a large Viterbi decoder is a complex task, since 
some programming errors may be revealed only by particular 
input sequences or error patterns. This decoder has been tested 
first against an existing software decoder for K = 7 and r = 1/2, 
which has been extensively used in the past. After it was ascer- 
tained that the two programs had identical behavior, the new 
program was tested with various other codes, and it also per- 
formed according to expectations. 

To run the program, which is reproduced in the Appendix, 
the user must enter K> the inverse n of the rate, the path trun- 
cation length Z, and the generator polynomials in octal. Also, 
the names of the files used to get the input received symbols 
and to write the decoded bits must be provided. Currently, the 
output consists of the decoded information bits and is written 
to a disk file. The test signals to be used for future testing of 
the VLSI circuits can be obtained by inserting print statements 
anywhere desired in the program. 
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Fig. 3. Encoder flow diagram 
















Fig. 5. Add-compare-select flow diagram 
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Appendix 


# include <stdio.h> 


/a********************************** VLSI ***********************************/ 

, */ 

/* 7 
/★ This program simulates the long constraint length VLSI Viterbi decoder. */ 

/* it allows the user to decode convolutional codes with constraint length */ 

/ * up to 15 and code 

rate 1/2 to 1/6. ' 

*/ 

/ ******************************************************************™******/ 

int n; 

/* rate= 1/n */ 

int L; 

/* buffer length */ 

int p,q; 

/* branch metrics */ 

int n tb; 

/* traceback addresses */ 

int time; 

/* traceback time */ 

int butt; 

/* loop counter */ 

int NS; 

/* number of states */ 

int NB; 

/* number of butterflies */ 

int k; 

/* constraint length */ 

int dec; 

/* decoding bank */ 

int miO, mil, mjO, mjl; 

/* accumulated metrics */ 

int blk time; 

/* time in traceback */ 

int bit no; 

f* number of symbols decoded */ 

int GP[6]; 

/* generator polynomials */ 

int out[100]; 

/* temp storage for decoded bits */ 

int outr[100]; 

/* storage for decoded bits */ 

int LABEL[8192]; 

/* butterfly labels */ 

int metric[16384], old_metric[ 16384]; /* accumulated metric storage */ 

char flag; 

/* renormalization flag *1 

char bitO, bitl; 

/* decision bits *1 

char RAM[16384][100][3]; 

f* traceback RAM*/ 

main () 


I 

int tb; 

f* traceback (tb) bank */ 

int Mo; 

/* parameter to calculate memory size */ 

int bank; 

/* loop counter */ 

int blk no; 

/* number of blocks decoded */ 

int blk_par; 

/* block parity (0 or 1) */ 

int n_dec; 

/* addresses of decoded bits */ 

int state; 

/* loop counter */ 

int symbol„no; 

/* loop counter */ 

int stateO, state 1; 

/* current states of butterfly */ 

int prev_statcO, prev_ 

state 1; /* previous states of butterfly */ 

int recsym[6]; 

/* received symbols (8-bit) */ 

i char decinp[10], decoutflO]; /* input and output files of decoder */ 

FILE *fpl, *fp2; 

/* input/output file pointers */ 

printf ("The simulation can decode binary data with a constraint length"); 

printf ("k < = 15 and code rate of 1/2 to 1/6"); 

printf ("Enter constraint length k"); 
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I 

I 


scanf ("%d\&k); 

printf ("Enter number of symbols n (2-6)"); 
scanf ("%d",&n); 

printf ("Enter length of traceback buffer L"); 
scanf ("%d",&L); 

for (symbol_no= 0; symbol_no< n; symboI_no+ + ) { 

printf ("Enter generating polynomial GP[%d] in OCTAL > ",symbol_no); 

scanf ("%o",&GP[symbol_no]); 

if (k < 15) GP[symbol_no] < < = (15 - k); 

} 


printf ("Enter binary input filename"); 
scanf (”%s",decinp); 

printf ("Enter output filename that will contain decoded bits"); 
scanf ("%s",decout); 


fpl = fopen (decinp,"r n ); 
fp2 = fopen (decout,"w“); 

bit_no = 0; 
symbol_no = 0; 
flag = 0; 
n_tb = 0; 

Mo = 14; 

NS = 01 < < Mo; 

NB = NS/2; 

f* set storage of decoded bits to zero */ 


/* open file of received symbols */ 

/* open file for decoder output */ 

/* set bit counter to zero */ 

/* set symbol counter to zero */ 
/* set renormalization flag to zero */ 

/* set starting tb addr. to zero */ 


/* number of states */ 
/* number of butterflies */ 


for time = 0; time < L; timc+ + ) out[time] = 0; 


f* initialize metrics, accumulated metrics, and traceback RAM to zero*/ 

for (state = 0; state < NS; state+ + ) { 
metric[state] = 0; 
old_metric[state] = 0; 
for (time = 0; time < L; time+ + ) 

for (bank = 0; bank < 3; bank+ + ) 
RAM[state][time][bank] = 0; 


/* generate the labels that are assigned to the butterfly */ 
encoder( ); 

/* receive data bits and enter decoder loop */ 

while ((recsym[symbol_no] = getc (fpl)) != EOF) { 
symbol_no+ + ; 

if (symbol_no = = n) { 

symboljno = 0; 
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/* check value of flag to determine to renormalize accumulated metrics */ 

if (Hag = = 0) 

for (state = 0; state < NS; state* + ) 
old_metric [state] = metric [state]; 

else { 

for (state = 0; state < NS; state* + ) 

old_metric[state] = metric [state] &077777; /* clear MSB */ 
flag = 0; 

} 

blk_time = bit_no%L; 

/* check to see if new traceback must be started */ 

if (blk_time = = 0) { 

blk_no = bit_no/L; 
blk_par = blk_no%2; 
tb = blk_no%3; 
dec = (tbf 1)%3; 
n_dec = n_tb; 
n_tb = 0; 

for (time = 0; time < L; time* + ) outr[iime] = out[time]; 

} 

/* determine whether to move left or right through traceback memory */ 

if (blk_par = = 0) time = L - blk_time - 1; 
else time = blk_time; 

f* generate the addresses for the decoded bits and traceback */ 

n_dcc = (n_dec > > 1) I (NB*RAM[n__dec] [time] [dec]); 
n_ib = (n_tb > > 1) I (NB*RAM{n_tb][time][tb]); 

out[blk_time] = (n_dec > > 5)&01; /* extract decoded bits */ 

/* Generate branch metrics associated with new received symbol, add to */ 

/* existing accumulated metrics, determine smallest accumulated metric */ 

/* at current state, and output decision bits to traceback memory */ 

for (butt = 0; butt < NB; butl+ + ) { 

/* compute the two current states of butterfly and their associated previous states */ 

statcO = butt < < 1 ; 
statel = statcO + 1; 
prev_stateO = butt; 
prev statel = prev_state0 I NB; 

metric_comp(recsym); /* call metric computer */ 

miO = old_metric[prev_stateO]; 
mil = old_metric[prev_statel]; 
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add_comp_select(); 


/* call add, compare, and select */ 


metric[stateO] = mjO ; 
metric[statel] = mjl ; 

/* write to traceback RAM, the bits at corresponding state of butterfly */ 

R AM[stateO][time][dec] = bitO; 
RAM[statel][iime]tdec] = bill; 


} 

fprintf (fp2, , '%d",outr[L-blk_time-l]); /* output decoded bits */ 
fflush(fp2); 

bit_no+ + ; /* increment bit counter */ 

) 

} 

} 


/ A A * A A * * A * A A A * * * A A * A A * * * A * * * * * METRIC COMPUTER ***************************** j 


*/ 

/* Thin nub rout, i no computes the branch metrics from the "n" received */ 
/* symbols. */ 
/ * */ 


/AAAAAAAAAAAAAAAAA A ******************************** *************************/ 


metric eomp(rccsym) 
int *rccsym; 

{ 

int symbol_no; 
int sum_rccsym; 
int cncodcd_bit; 
int mag_rccsym; 
int sign_recsym; 


/* loop counter */ 

/* maximum branch metric */ 

/* one bit of branch label */ 

/* received symbol magnitude */ 
/* sign of received symbol */ 


sum_rccsym = 0; /* set branch metric to zero */ 

P = 0; 

for (symbol_no = 0; symbol_no < n; symbol_no+ + ) { 

mag_recsym = recsym[symbol_no] & 0177; /* mask the first eight bits */ 

sign_recsym = (recsym[symbol_no] > > 7) & 01; /* extract sign bit */ 

encodedJ)it = (LABEL[butt] > > symbol_no) & 01; /* strip label bits */ 

sum_recsym + = mag_recsym; /* sum all the received symbol magnitudes */ 
if ((encoded_bit A sign_recsym) = = 01) p + = mag_recsym; 

} 

q = (sum_recsym -p); 


/************************* RDDf COMPARE , AND SELECT ************************/ 
/* */ 
/* Add branch metrics to accumulated metrics. The pair of sums at each */ 
/* of the states is compared and the smallest is selected. The output */ 
/* of each of these decisions is the smallest accumulated metric at each */ 
/* state and the decision bits which are sent to the traceback memory. */ 
/* */ 

/*************************^*************************************************^ 
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add_comp_seIect( ) 


{ 

int sOO, slO, sOl , si 1 ; /* the accumulated metrics */ 

/* add branch metric to accumulated metric */ 

sOO = (miO + p); 
slO = (mil + q); 
sOl = (miO + q); 
si 1 = (mil + p); 

/* determine smallest metric for present two slates of butterfly */ 

if (sOO < si 0) ( bilO = 0; mjO = s()(); ) 
else { bill) = 1 ; mj() = sl(); ) 

if (sOl < si 1) { bill = 0: mjl = s()l; | 
else | bill = 1; mjl = si l; | 

f* cheek constraint length and sol outpul accumulalcd metrics respectively */ 
if (k < 15) mjl = mjO; 

/* deienuine if accumulaied metrics must be renormalized, if so, set flag */ 


I 


if (bull - = 0 && (mjO > > 14) = = 3) Hag = 1; 


/ A A A n U * A U ***** A * * A * * ********** * ENCODER 

/ A 

/ A This subroutine generates the labels f 
/* the appropriate generating polynomials 
/ * 

/***************************************** 


****************★****************/ 

*/ 

or each butterfly by utilizing */ 

*/ 

*/ 


cncoder( ) 


int butt; 
int symbol_no; 
int encoded [6]; 
unsigned int masked; 


/* loop counter */ 

/* loop counter */ 
f* encoded symbols */ 
/* the masked state */ 


/* encode butterfly labels and do appropriate shifting */ 

for (butt = 0; butt < NB; butt+ + ) ( 

for (symbol_no = 0; symbol_no < n; symboLno+ + ) { 
cncodcd[symbol__no] = 0; 

masked = (butt < < 1) & GP[symbol_no]; /* mask the butterfly */ 
for ( ; masked > 0; masked > > = 1) 

cncodcd[symbol_no] A = masked; /* sum the bits of butterfly */ 
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